Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 786600 |
| Missing cells | 24767 |
| Missing cells (%) | 0.2% |
| Duplicate rows | 546 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 90.0 MiB |
| Average record size in memory | 120.0 B |
Variable types
| NUM | 10 |
|---|---|
| BOOL | 2 |
| CAT | 2 |
| Dataset has 546 (0.1%) duplicate rows | Duplicates |
customer_id has a high cardinality: 245455 distinct values | High cardinality |
order_date has a high cardinality: 776 distinct values | High cardinality |
customer_order_rank has 24767 (3.1%) missing values | Missing |
voucher_amount is highly skewed (γ1 = 30.39394065) | Skewed |
platform_id is highly skewed (γ1 = -22.53663783) | Skewed |
voucher_amount has 743462 (94.5%) zeros | Zeros |
delivery_fee has 597536 (76.0%) zeros | Zeros |
Reproduction
| Analysis started | 2020-10-11 18:10:25.002419 |
|---|---|
| Analysis finished | 2020-10-11 18:13:23.872274 |
| Duration | 2 minutes and 58.87 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 245455 |
|---|---|
| Distinct (%) | 31.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| 15edce943edd | 386 |
|---|---|
| 8745a335e9cf | 288 |
| d956116d863d | 286 |
| 0063666607bb | 273 |
| ae60dce05485 | 270 |
| Other values (245450) |
| Value | Count | Frequency (%) | |
| 15edce943edd | 386 | < 0.1% | |
| 8745a335e9cf | 288 | < 0.1% | |
| d956116d863d | 286 | < 0.1% | |
| 0063666607bb | 273 | < 0.1% | |
| ae60dce05485 | 270 | < 0.1% | |
| a54a8e1579d4 | 254 | < 0.1% | |
| bebb751d49b8 | 253 | < 0.1% | |
| 26ed6389a3aa | 245 | < 0.1% | |
| ef6265f74aca | 229 | < 0.1% | |
| a333fb175a0c | 221 | < 0.1% | |
| Other values (245445) | 783895 | 99.7% |
Frequencies of value counts
Unique
| Unique | 145498 ? |
|---|---|
| Unique (%) | 18.5% |
Histogram of lengths of the category
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
| Distinct | 776 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| 2017-01-01 | 4230 |
|---|---|
| 2016-12-18 | 3395 |
| 2017-02-26 | 3234 |
| 2017-02-05 | 3218 |
| 2017-02-12 | 3125 |
| Other values (771) |
| Value | Count | Frequency (%) | |
| 2017-01-01 | 4230 | 0.5% | |
| 2016-12-18 | 3395 | 0.4% | |
| 2017-02-26 | 3234 | 0.4% | |
| 2017-02-05 | 3218 | 0.4% | |
| 2017-02-12 | 3125 | 0.4% | |
| 2016-12-11 | 3100 | 0.4% | |
| 2016-12-04 | 3075 | 0.4% | |
| 2017-01-22 | 3005 | 0.4% | |
| 2017-01-29 | 3003 | 0.4% | |
| 2016-10-03 | 2999 | 0.4% | |
| Other values (766) | 754216 | 95.9% |
Frequencies of value counts
Unique
| Unique | 41 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
order_hour
Real number (ℝ≥0)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.58879608 |
|---|---|
| Minimum | 0 |
| Maximum | 23 |
| Zeros | 4627 |
| Zeros (%) | 0.6% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 16 |
| median | 18 |
| Q3 | 20 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.357192477 |
|---|---|
| Coefficient of variation (CV) | 0.1908710785 |
| Kurtosis | 5.749711941 |
| Mean | 17.58879608 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -1.749088644 |
| Sum | 13835347 |
| Variance | 11.27074133 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=24)
| Value | Count | Frequency (%) | |
| 19 | 134030 | 17.0% | |
| 18 | 129654 | 16.5% | |
| 20 | 108739 | 13.8% | |
| 17 | 90782 | 11.5% | |
| 21 | 68223 | 8.7% | |
| 16 | 48877 | 6.2% | |
| 15 | 34286 | 4.4% | |
| 22 | 33403 | 4.2% | |
| 13 | 31105 | 4.0% | |
| 14 | 30323 | 3.9% | |
| Other values (14) | 77178 | 9.8% |
| Value | Count | Frequency (%) | |
| 0 | 4627 | 0.6% | |
| 1 | 2425 | 0.3% | |
| 2 | 1187 | 0.2% | |
| 3 | 443 | 0.1% | |
| 4 | 137 | < 0.1% |
| Value | Count | Frequency (%) | |
| 23 | 13832 | 1.8% | |
| 22 | 33403 | 4.2% | |
| 21 | 68223 | 8.7% | |
| 20 | 108739 | 13.8% | |
| 19 | 134030 | 17.0% |
| Distinct | 369 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 24767 |
| Missing (%) | 3.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.436809642 |
|---|---|
| Minimum | 1 |
| Maximum | 369 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 10 |
| 95-th percentile | 39 |
| Maximum | 369 |
| Range | 368 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 17.77232218 |
|---|---|
| Coefficient of variation (CV) | 1.88329773 |
| Kurtosis | 49.04720204 |
| Mean | 9.436809642 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 5.494014541 |
| Sum | 7189273 |
| Variance | 315.8554356 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 1 | 244937 | 31.1% | |
| 2 | 96641 | 12.3% | |
| 3 | 60532 | 7.7% | |
| 4 | 43681 | 5.6% | |
| 5 | 34036 | 4.3% | |
| 6 | 27603 | 3.5% | |
| 7 | 23049 | 2.9% | |
| 8 | 19696 | 2.5% | |
| 9 | 17013 | 2.2% | |
| 10 | 14889 | 1.9% | |
| Other values (359) | 179756 | 22.9% | |
| (Missing) | 24767 | 3.1% |
| Value | Count | Frequency (%) | |
| 1 | 244937 | 31.1% | |
| 2 | 96641 | 12.3% | |
| 3 | 60532 | 7.7% | |
| 4 | 43681 | 5.6% | |
| 5 | 34036 | 4.3% |
| Value | Count | Frequency (%) | |
| 369 | 1 | < 0.1% | |
| 368 | 1 | < 0.1% | |
| 367 | 1 | < 0.1% | |
| 366 | 1 | < 0.1% | |
| 365 | 1 | < 0.1% |
is_failed
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| 0 | |
|---|---|
| 1 | 24767 |
| Value | Count | Frequency (%) | |
| 0 | 761833 | 96.9% | |
| 1 | 24767 | 3.1% |
| Distinct | 911 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.09148909292 |
|---|---|
| Minimum | 0 |
| Maximum | 93.3989 |
| Zeros | 743462 |
| Zeros (%) | 94.5% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.686 |
| Maximum | 93.3989 |
| Range | 93.3989 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4795579176 |
|---|---|
| Coefficient of variation (CV) | 5.241694963 |
| Kurtosis | 3886.352852 |
| Mean | 0.09148909292 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 30.39394065 |
| Sum | 71965.32049 |
| Variance | 0.2299757963 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 743462 | 94.5% | |
| 1.029 | 11647 | 1.5% | |
| 1.715 | 11134 | 1.4% | |
| 2.058 | 9122 | 1.2% | |
| 0.686 | 3648 | 0.5% | |
| 1.372 | 1770 | 0.2% | |
| 2.744 | 1192 | 0.2% | |
| 2.5725 | 897 | 0.1% | |
| 3.43 | 543 | 0.1% | |
| 0.5145 | 373 | < 0.1% | |
| Other values (901) | 2812 | 0.4% |
| Value | Count | Frequency (%) | |
| 0 | 743462 | 94.5% | |
| 0.00343 | 35 | < 0.1% | |
| 0.28469 | 1 | < 0.1% | |
| 0.32242 | 1 | < 0.1% | |
| 0.343 | 19 | < 0.1% |
| Value | Count | Frequency (%) | |
| 93.3989 | 1 | < 0.1% | |
| 78.02907 | 1 | < 0.1% | |
| 68.3942 | 1 | < 0.1% | |
| 61.82575 | 1 | < 0.1% | |
| 37.57565 | 1 | < 0.1% |
| Distinct | 98 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1811799318 |
|---|---|
| Minimum | 0 |
| Maximum | 9.86 |
| Zeros | 597536 |
| Zeros (%) | 76.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.986 |
| Maximum | 9.86 |
| Range | 9.86 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3697095668 |
|---|---|
| Coefficient of variation (CV) | 2.040565769 |
| Kurtosis | 8.481347092 |
| Mean | 0.1811799318 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.417459196 |
| Sum | 142516.1343 |
| Variance | 0.1366851638 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 597536 | 76.0% | |
| 0.493 | 70617 | 9.0% | |
| 0.986 | 35735 | 4.5% | |
| 0.7395 | 34790 | 4.4% | |
| 0.2465 | 7664 | 1.0% | |
| 1.2325 | 7164 | 0.9% | |
| 1.479 | 6768 | 0.9% | |
| 1.4297 | 5078 | 0.6% | |
| 0.46835 | 3097 | 0.4% | |
| 0.4437 | 2657 | 0.3% | |
| Other values (88) | 15494 | 2.0% |
| Value | Count | Frequency (%) | |
| 0 | 597536 | 76.0% | |
| 0.02465 | 10 | < 0.1% | |
| 0.0493 | 3 | < 0.1% | |
| 0.0986 | 4 | < 0.1% | |
| 0.1479 | 303 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9.86 | 1 | < 0.1% | |
| 7.395 | 1 | < 0.1% | |
| 6.6555 | 1 | < 0.1% | |
| 6.409 | 1 | < 0.1% | |
| 5.916 | 1 | < 0.1% |
amount_paid
Real number (ℝ≥0)
| Distinct | 6471 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.18327131 |
|---|---|
| Minimum | 0 |
| Maximum | 1131.03 |
| Zeros | 872 |
| Zeros (%) | 0.1% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4.5135 |
| Q1 | 6.64812 |
| median | 9.027 |
| Q3 | 12.213 |
| 95-th percentile | 19.5408 |
| Maximum | 1131.03 |
| Range | 1131.03 |
| Interquartile range (IQR) | 5.56488 |
Descriptive statistics
| Standard deviation | 5.6181212 |
|---|---|
| Coefficient of variation (CV) | 0.5517010233 |
| Kurtosis | 2243.912588 |
| Mean | 10.18327131 |
| Median Absolute Deviation (MAD) | 2.655 |
| Skewness | 15.5881411 |
| Sum | 8010161.21 |
| Variance | 31.56328582 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 5.31 | 14667 | 1.9% | |
| 7.965 | 14410 | 1.8% | |
| 6.372 | 11878 | 1.5% | |
| 8.496 | 10350 | 1.3% | |
| 6.903 | 9988 | 1.3% | |
| 5.841 | 9734 | 1.2% | |
| 9.027 | 9213 | 1.2% | |
| 7.434 | 9156 | 1.2% | |
| 10.62 | 8982 | 1.1% | |
| 9.558 | 8377 | 1.1% | |
| Other values (6461) | 679845 | 86.4% |
| Value | Count | Frequency (%) | |
| 0 | 872 | 0.1% | |
| 0.00531 | 1 | < 0.1% | |
| 0.01593 | 1 | < 0.1% | |
| 0.02655 | 1 | < 0.1% | |
| 0.03717 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1131.03 | 1 | < 0.1% | |
| 581.7105 | 1 | < 0.1% | |
| 363.01815 | 1 | < 0.1% | |
| 353.3805 | 1 | < 0.1% | |
| 246.88845 | 1 | < 0.1% |
restaurant_id
Real number (ℝ≥0)
| Distinct | 13569 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 162864079.3 |
|---|---|
| Minimum | 73498 |
| Maximum | 340453498 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 73498 |
|---|---|
| 5-th percentile | 29803498 |
| Q1 | 86023498 |
| median | 169613498 |
| Q3 | 228433498 |
| 95-th percentile | 302393498 |
| Maximum | 340453498 |
| Range | 340380000 |
| Interquartile range (IQR) | 142410000 |
Descriptive statistics
| Standard deviation | 87830821.23 |
|---|---|
| Coefficient of variation (CV) | 0.5392890906 |
| Kurtosis | -1.08595334 |
| Mean | 162864079.3 |
| Median Absolute Deviation (MAD) | 71240000 |
| Skewness | -0.02254910338 |
| Sum | 1.281088848e+14 |
| Variance | 7.714253157e+15 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 37623498 | 1317 | 0.2% | |
| 983498 | 1071 | 0.1% | |
| 192673498 | 1031 | 0.1% | |
| 154543498 | 999 | 0.1% | |
| 88773498 | 967 | 0.1% | |
| 146723498 | 942 | 0.1% | |
| 105253498 | 935 | 0.1% | |
| 18603498 | 922 | 0.1% | |
| 30633498 | 918 | 0.1% | |
| 29593498 | 882 | 0.1% | |
| Other values (13559) | 776616 | 98.7% |
| Value | Count | Frequency (%) | |
| 73498 | 120 | < 0.1% | |
| 123498 | 37 | < 0.1% | |
| 153498 | 193 | < 0.1% | |
| 173498 | 181 | < 0.1% | |
| 193498 | 84 | < 0.1% |
| Value | Count | Frequency (%) | |
| 340453498 | 1 | < 0.1% | |
| 340093498 | 2 | < 0.1% | |
| 340033498 | 1 | < 0.1% | |
| 339983498 | 2 | < 0.1% | |
| 339913498 | 1 | < 0.1% |
city_id
Real number (ℝ≥0)
| Distinct | 3749 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47179.7505 |
|---|---|
| Minimum | 230 |
| Maximum | 100205 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 230 |
|---|---|
| 5-th percentile | 10346 |
| Q1 | 24799 |
| median | 46467 |
| Q3 | 67886 |
| 95-th percentile | 89749 |
| Maximum | 100205 |
| Range | 99975 |
| Interquartile range (IQR) | 43087 |
Descriptive statistics
| Standard deviation | 25904.63056 |
|---|---|
| Coefficient of variation (CV) | 0.5490624747 |
| Kurtosis | -1.018564164 |
| Mean | 47179.7505 |
| Median Absolute Deviation (MAD) | 21419 |
| Skewness | 0.05185593619 |
| Sum | 3.711159174e+10 |
| Variance | 671049884.7 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 10346 | 86654 | 11.0% | |
| 20326 | 36210 | 4.6% | |
| 80562 | 34100 | 4.3% | |
| 50898 | 21627 | 2.7% | |
| 40441 | 16732 | 2.1% | |
| 60537 | 14760 | 1.9% | |
| 44366 | 14119 | 1.8% | |
| 45358 | 11246 | 1.4% | |
| 4334 | 11106 | 1.4% | |
| 90633 | 10449 | 1.3% | |
| Other values (3739) | 529597 | 67.3% |
| Value | Count | Frequency (%) | |
| 230 | 993 | 0.1% | |
| 1298 | 6519 | 0.8% | |
| 1676 | 77 | < 0.1% | |
| 1685 | 33 | < 0.1% | |
| 1689 | 18 | < 0.1% |
| Value | Count | Frequency (%) | |
| 100205 | 1 | < 0.1% | |
| 100079 | 1 | < 0.1% | |
| 100061 | 3 | < 0.1% | |
| 100048 | 56 | < 0.1% | |
| 99999 | 5 | < 0.1% |
payment_id
Real number (ℝ≥0)
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1668.509077 |
|---|---|
| Minimum | 1491 |
| Maximum | 1811 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 1491 |
|---|---|
| 5-th percentile | 1523 |
| Q1 | 1619 |
| median | 1619 |
| Q3 | 1779 |
| 95-th percentile | 1779 |
| Maximum | 1811 |
| Range | 320 |
| Interquartile range (IQR) | 160 |
Descriptive statistics
| Standard deviation | 87.19266546 |
|---|---|
| Coefficient of variation (CV) | 0.05225783105 |
| Kurtosis | -1.011622604 |
| Mean | 1668.509077 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.2658271582 |
| Sum | 1312449240 |
| Variance | 7602.56091 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=5)
| Value | Count | Frequency (%) | |
| 1619 | 476600 | 60.6% | |
| 1779 | 234133 | 29.8% | |
| 1491 | 36497 | 4.6% | |
| 1811 | 34492 | 4.4% | |
| 1523 | 4878 | 0.6% |
| Value | Count | Frequency (%) | |
| 1491 | 36497 | 4.6% | |
| 1523 | 4878 | 0.6% | |
| 1619 | 476600 | 60.6% | |
| 1779 | 234133 | 29.8% | |
| 1811 | 34492 | 4.4% |
| Value | Count | Frequency (%) | |
| 1811 | 34492 | 4.4% | |
| 1779 | 234133 | 29.8% | |
| 1619 | 476600 | 60.6% | |
| 1523 | 4878 | 0.6% | |
| 1491 | 36497 | 4.6% |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29868.52938 |
|---|---|
| Minimum | 525 |
| Maximum | 30423 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 525 |
|---|---|
| 5-th percentile | 29463 |
| Q1 | 29463 |
| median | 29815 |
| Q3 | 30231 |
| 95-th percentile | 30359 |
| Maximum | 30423 |
| Range | 29898 |
| Interquartile range (IQR) | 768 |
Descriptive statistics
| Standard deviation | 1160.893265 |
|---|---|
| Coefficient of variation (CV) | 0.03886677012 |
| Kurtosis | 565.3036862 |
| Mean | 29868.52938 |
| Median Absolute Deviation (MAD) | 352 |
| Skewness | -22.53663783 |
| Sum | 2.349458521e+10 |
| Variance | 1347673.174 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=14)
| Value | Count | Frequency (%) | |
| 29463 | 241523 | 30.7% | |
| 30231 | 216726 | 27.6% | |
| 29815 | 158972 | 20.2% | |
| 30359 | 103653 | 13.2% | |
| 30391 | 24434 | 3.1% | |
| 29751 | 19321 | 2.5% | |
| 29495 | 11151 | 1.4% | |
| 30423 | 6819 | 0.9% | |
| 30199 | 2079 | 0.3% | |
| 525 | 1094 | 0.1% | |
| Other values (4) | 828 | 0.1% |
| Value | Count | Frequency (%) | |
| 525 | 1094 | 0.1% | |
| 22167 | 3 | < 0.1% | |
| 22263 | 232 | < 0.1% | |
| 22295 | 1 | < 0.1% | |
| 29463 | 241523 | 30.7% |
| Value | Count | Frequency (%) | |
| 30423 | 6819 | 0.9% | |
| 30391 | 24434 | 3.1% | |
| 30359 | 103653 | 13.2% | |
| 30231 | 216726 | 27.6% | |
| 30199 | 2079 | 0.3% |
transmission_id
Real number (ℝ≥0)
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4253.246112 |
|---|---|
| Minimum | 212 |
| Maximum | 21124 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 212 |
|---|---|
| 5-th percentile | 4228 |
| Q1 | 4228 |
| median | 4324 |
| Q3 | 4356 |
| 95-th percentile | 4356 |
| Maximum | 21124 |
| Range | 20912 |
| Interquartile range (IQR) | 128 |
Descriptive statistics
| Standard deviation | 572.8556657 |
|---|---|
| Coefficient of variation (CV) | 0.1346866959 |
| Kurtosis | 176.6261099 |
| Mean | 4253.246112 |
| Median Absolute Deviation (MAD) | 32 |
| Skewness | -0.9114324558 |
| Sum | 3345603392 |
| Variance | 328163.6137 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 4356 | 341734 | 43.4% | |
| 4324 | 203668 | 25.9% | |
| 4228 | 201617 | 25.6% | |
| 4260 | 14538 | 1.8% | |
| 212 | 12676 | 1.6% | |
| 4996 | 6737 | 0.9% | |
| 4196 | 5276 | 0.7% | |
| 1988 | 207 | < 0.1% | |
| 21124 | 146 | < 0.1% | |
| 2020 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 212 | 12676 | 1.6% | |
| 1988 | 207 | < 0.1% | |
| 2020 | 1 | < 0.1% | |
| 4196 | 5276 | 0.7% | |
| 4228 | 201617 | 25.6% |
| Value | Count | Frequency (%) | |
| 21124 | 146 | < 0.1% | |
| 4996 | 6737 | 0.9% | |
| 4356 | 341734 | 43.4% | |
| 4324 | 203668 | 25.9% | |
| 4260 | 14538 | 1.8% |
is_returning_customer
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 408889 | 52.0% | |
| 0 | 377711 | 48.0% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| customer_id | order_date | order_hour | customer_order_rank | is_failed | voucher_amount | delivery_fee | amount_paid | restaurant_id | city_id | payment_id | platform_id | transmission_id | is_returning_customer | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 000097eabfd9 | 2015-06-20 | 19 | 1.0 | 0 | 0.0 | 0.000 | 11.46960 | 5803498 | 20326 | 1779 | 30231 | 4356 | 0 |
| 1 | 0000e2c6d9be | 2016-01-29 | 20 | 1.0 | 0 | 0.0 | 0.000 | 9.55800 | 239303498 | 76547 | 1619 | 30359 | 4356 | 0 |
| 2 | 000133bb597f | 2017-02-26 | 19 | 1.0 | 0 | 0.0 | 0.493 | 5.93658 | 206463498 | 33833 | 1619 | 30359 | 4324 | 1 |
| 3 | 00018269939b | 2017-02-05 | 17 | 1.0 | 0 | 0.0 | 0.493 | 9.82350 | 36613498 | 99315 | 1619 | 30359 | 4356 | 0 |
| 4 | 0001a00468a6 | 2015-08-04 | 19 | 1.0 | 0 | 0.0 | 0.493 | 5.15070 | 225853498 | 16456 | 1619 | 29463 | 4356 | 0 |
| 5 | 0001d9036b5e | 2015-08-29 | 19 | 1.0 | 0 | 0.0 | 0.000 | 11.94750 | 193643498 | 88276 | 1619 | 29463 | 4356 | 0 |
| 6 | 0001d9036b5e | 2017-01-04 | 17 | 2.0 | 0 | 0.0 | 0.000 | 11.15100 | 193643498 | 88276 | 1619 | 29463 | 4356 | 0 |
| 7 | 0001d9036b5e | 2017-01-28 | 16 | 3.0 | 0 | 0.0 | 0.000 | 9.71730 | 193643498 | 88276 | 1619 | 30359 | 4356 | 0 |
| 8 | 0001e1e04d7d | 2015-10-24 | 19 | 1.0 | 0 | 0.0 | 0.000 | 25.22250 | 144833498 | 45358 | 1619 | 29463 | 4356 | 1 |
| 9 | 0001e1e04d7d | 2016-03-24 | 19 | 2.0 | 0 | 0.0 | 0.000 | 9.29250 | 95953498 | 45358 | 1619 | 29463 | 4324 | 1 |
Last rows
| customer_id | order_date | order_hour | customer_order_rank | is_failed | voucher_amount | delivery_fee | amount_paid | restaurant_id | city_id | payment_id | platform_id | transmission_id | is_returning_customer | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 786590 | fffcf45e5c69 | 2016-11-19 | 12 | 1.0 | 0 | 0.0 | 0.0000 | 12.53160 | 107463498 | 39335 | 1619 | 29463 | 4356 | 0 |
| 786591 | fffcf45e5c69 | 2017-02-04 | 12 | 2.0 | 0 | 0.0 | 0.0000 | 11.57580 | 107463498 | 39335 | 1619 | 30359 | 4356 | 0 |
| 786592 | fffd696eaedd | 2015-09-14 | 12 | 1.0 | 0 | 0.0 | 1.4297 | 24.13395 | 95323498 | 80562 | 1779 | 29463 | 4356 | 0 |
| 786593 | fffe9d5a8d41 | 2016-07-31 | 21 | NaN | 1 | 0.0 | 0.0000 | 8.44290 | 156133498 | 10346 | 1811 | 29463 | 212 | 1 |
| 786594 | fffe9d5a8d41 | 2016-09-30 | 20 | 1.0 | 0 | 0.0 | 0.0000 | 10.72620 | 983498 | 10346 | 1779 | 29463 | 4228 | 1 |
| 786595 | fffe9d5a8d41 | 2016-09-30 | 20 | NaN | 1 | 0.0 | 0.0000 | 10.72620 | 983498 | 10346 | 1779 | 29463 | 212 | 1 |
| 786596 | ffff347c3cfa | 2016-08-17 | 21 | 1.0 | 0 | 0.0 | 0.0000 | 7.59330 | 52893498 | 41978 | 1619 | 30359 | 4356 | 1 |
| 786597 | ffff347c3cfa | 2016-09-15 | 21 | 2.0 | 0 | 0.0 | 0.0000 | 5.94720 | 164653498 | 41978 | 1619 | 30359 | 4356 | 1 |
| 786598 | ffff4519b52d | 2016-04-02 | 19 | 1.0 | 0 | 0.0 | 0.0000 | 21.77100 | 16363498 | 80562 | 1491 | 29751 | 4228 | 0 |
| 786599 | ffffccbfc8a4 | 2015-05-30 | 20 | 1.0 | 0 | 0.0 | 0.0000 | 16.46100 | 150293498 | 45952 | 1619 | 29463 | 4324 | 0 |